Picture for Zhen Dong

Zhen Dong

AnchoredDream: Zero-Shot 360° Indoor Scene Generation from a Single View via Geometric Grounding

Add code
Jan 26, 2026
Viaarxiv icon

Expert Knowledge-Guided Decision Calibration for Accurate Fine-Grained Tree Species Classification

Add code
Jan 23, 2026
Viaarxiv icon

Unleashing the Capabilities of Large Vision-Language Models for Intelligent Perception of Roadside Infrastructure

Add code
Jan 15, 2026
Viaarxiv icon

SVII-3D: Advancing Roadside Infrastructure Inventory with Decimeter-level 3D Localization and Comprehension from Sparse Street Imagery

Add code
Jan 15, 2026
Viaarxiv icon

WHU-PCPR: A cross-platform heterogeneous point cloud dataset for place recognition in complex urban scenes

Add code
Jan 10, 2026
Viaarxiv icon

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Can World Models Benefit VLMs for World Dynamics?

Add code
Oct 01, 2025
Viaarxiv icon

WHU-STree: A Multi-modal Benchmark Dataset for Street Tree Inventory

Add code
Sep 16, 2025
Viaarxiv icon

ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms

Add code
Sep 11, 2025
Viaarxiv icon